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ESTIMATION FOR LEVY PROCESSES FROM HIGH FREQUENCY 
DATA WITHIN A LONG TIME INTERVAL 

By Fabienne Comte and Valentine Genon-Catalot 

University Paris Descartes, MAPS 

In this paper, we study nonparametric estimation of the Levy 
density for Levy processes, with and without Brownian component. 
For this, we consider n discrete time observations with step A. The 
asymptotic framework is: n tends to infinity, A = A„ tends to zero 
while nAn tends to infinity. We use a Fourier approach to construct 
an adaptive nonparametric estimator of the Levy density and to pro- 
vide a bound for the global L^-risk. Estimators of the drift and of 
the variance of the Gaussian component are also studied. We discuss 
rates of convergence and give examples and simulation results for 
processes fitting in our framework. 

1. Introduction. Let {Lt,t > 0) be a real-valued Levy process, that is, 
a process with stationary independent increments and cadlag sample paths. 
The distribution of {Lt ,t >0) is completely specified by the characteristic 
function ipt{u) = K{expiuLt) of the random variable Lj which has the form 

(1.1) iptiu) =exj)t(iub--u^a'^ + I (e*""" - 1 - mxlu|<i)iV(dx) ) , 
V 2 Jr/{o} J 

where 6 £ M, o"^ > and N{dx) is a positive measure on K/{0} satisfy- 
ing /jg/^Qj A lN{dx) < oo [see, e.g., Bertoin (1996) or Sato (1999)]. Thus, 
the statistical problem for Levy processes is the estimation of its character- 
istic triple (6,cT^,A^) where appears a finite-dimensional parameter (6,0"^) 
and an infinite-dimensional parameter A^, the Levy measure. In most recent 
contributions, authors consider a discrete time observation of the sample 
path, with regular sampling interval A. Therefore, statistical procedures 
are based on the i.i.d. sample composed of the increments [Z^ = = 
LkA — -^^(fc-i)A;^ = l,•••,?^)• In the general case, the distribution of the 
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r.v. is not explicitly given as a function of {b,a'^,N). This is why authors 
rather use the relationship between the characteristic function of and 
the characteristic triple. Assuming that N{dx) = n{x) dx admits a density, 
several papers concentrate on the estimation of the Levy density under vari- 
ous assumptions on the characteristic triple, including the case of 6 = o"^ = 
or assuming stronger integrability conditions on the Levy density [see, e.g., 
Watteel and Kulperger (2003), Jongbloed and van der Meulen (2006), van 
Es, Gugushvili and Spreij (2007), Figueroa-Lopez (2009) and the references 
therein, Comte and Genon-Catalot (2009, 2010a, 2010b)]. The joint estima- 
tion of {b,a'^,N) is investigated in Neumann and Reiss (2009) or Gugushvili 
(2009). The methods and results differ according to the asymptotic point 
of view. One may consider that the sampling interval A is fixed and that n 
tends to infinity (low frequency data). This approach, which is quite natural, 
raises mathematical difficulties and does not take into account the under- 
lying continuous time model properties. One may consider that A — 
tends to as n tends to infinity (high frequency data). Under the assump- 
tion that A„ tends to within a fixed length time interval (nA„ = t fixed) , 
the estimation of a has been widely investigated for Levy processes [see, 
e.g., Woerner (2006), Barndorff-Nielsen, Shephard and Winkel (2006), Ja- 
cod (2007)]. However, the Levy density cannot be identified from observa- 
tions within a finite-length time interval. To identify all parameters in the 
high-frequency context, one has to assume both that A„, tends to and 
nA„ tends to infinity. This is the point of view adopted in this paper. Our 
main focus is the nonpar ametric estimation of the Levy density n(-) by an 
adaptive deconvolution method which generalizes the study of Comte and 
Genon-Catalot (2009). We also study estimators of the other parameters. 
More precisely, we assume that the Levy density satisfies 

(HI) / x'^n{x) dx < oo. 

Jr 

For statistical purposes, this assumption, which was proposed in Neumann 
and Reiss (2009), has several useful consequences. First, for all t, EL'f < 
+00 and as J^ie^^^ — 1 — iux)n{x)dx is well defined, we get the following 
expression for (1.1): 

(1.2) il^tiu) =expt^ub -^u^a'^ + J {e''''^ - 1 - iux)n{x) dxj , 

where b = KLi has a statistical meaning (contrary to b). 

In Section 2, we present our main assumptions and some preliminary 
properties. In Section 3, we assume that a = and study the estimation of 
the function h{x) = x'^n{x). Using a sample of size 2n, we build two col- 
lections of estimators {hm,hm)m>o indexed by a cut-off parameter m. The 
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collections are obtained by Fourier inversion of two different estimators of 
the Fourier transform h* of the function h. The estimators of h* are built 
using empirical estimators of the characteristic function and its first 
two derivatives. First, we give a bound for the L^-risk of {hm,hm) for fixed 
m. Then, introducing an adequate penalty, we propose a data-driven choice 
of the cut-off parameter which yields an estimator {hm,hfh) for each col- 
lection. The L^-risk of these estimators is studied. We discuss the rates of 
convergence reached on Sobolev classes of regularity for the function h. In 
Section 4, we consider the general case. To reach the Levy density and get 
rid of the unknown , we must now use derivatives of ip/\ up to the order 3 
and we estimate the function p{x) = x^n{x) developing the Fourier inversion 
approach and adaptive choice of the cut-off parameter as for h. It is worth 
stressing that the point of view of small sampling interval is crucial to our 
study. Indeed, it helps obtaining simple estimators of ipA and its successive 
derivatives which are used to estimate the Fourier transform p* of p. Section 
5 is devoted to the estimation of {b,a). We study classical empirical means 
of the observations. This gives an estimator of b but cannot give estima- 
tors of cj. To estimate a, we consider power variation estimators, introduced 
in Woerner (2006), Barndorff-Nielsen, Shephard and Winkel (2006), Jacod 
(2007), Ai't-Sahalia and Jacod (2007), under the asymptotic framework of 
high frequency data within a long time interval. In Section 6, we give exam- 
ples of Levy models satisfying our set of assumptions. We provide numerical 
simulation results in Section 7. Section 8 contains the main proofs. In the 
Appendix, two classical results, used in proofs, are recalled. 



2. Assumptions and preliminary properties. Let us consider the two 
functions 

h{x) = x'^n{x), p{x)=x^n{x), 

and the assumptions 

(H2XA;) / |x|''n(x)(i2; < oo. 



or 



(H3) h belongs to L2( 

(H4) jA?i.),. = jAH.)i.<^ 

{ (H5) p belongs to L2(M) 

|(H6) JxVix)dx = lxVix)dx<oc. 
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Assumption (H2)(A;) is a moment assmnption. Indeed, according to Sato 
[(1999), Section 5.25, Theorem 5.23], E|Lj|'' < oo is equivalent to 
i|a;|>i l^l^iT-i^) < oo. Below, for each stated result, the required value of k 
is given. Under (HI), the function h is integrable and Section 3 is devoted to 
the nonparametric estimation of h under the additional assumptions (H3)- 
(H4) when cj^ = 0. Assumption (H4) is only required for the adaptive result. 
Under (H1)-(H2)(3), the function p is integrable and Section 4 concerns the 
estimation of p under (H5)~(H6) when cr^ ^ 0. 

Properties of the moments of La = = Z\ for small A are used in the 
proofs below. 

Lemma 2.1. Let k>l be an integer and assume (Rl)-(R2){k) with 
k = 3 (ork>3). Then, EdZi]*^) < +oo andK{Zi)=bA, Var(Zi) = A(a2 + 
/ x^n(x) dx) and for 3<£<k, E(Zf) = Aq + o(A) where ce = J x^n{x) dx. 

Thus, under (HI), (H2)(/c), E(Zf/A) is bounded for ah i<k, for ah A. 
In the sequel, results on the behavior of the characteristic function t/^a 
[see (1.2)] for small A are needed. 

Lemma 2.2. Under {El), \'il)/^{u) — l\< lS.\u\{c{u) + a'^\u\) where c{u) = 
\h\ + I /q" |/i*(f )| h*{v) = J e™^h{x)dx denotes the Fourier transform of 
h. If h* is integrable on M, then 

\ip/i{u)-l\<A.\u\{\h\ + \h*\i + \u\a'^). 

Proof. By formula (1.2), under (HI), V'A is with V'aC^) = ^'0A(ii) x 
{4>W) — c^u), where we have set, using that e*"^ — l = ix e*^^ dv, 

(2.1) (t){u) = ib- / h*{v)dv. 

Jo 

We have \<p{u)\ < |6| + 1 Jq \h*{v)\ dv\ and by the Taylor formula, 'il)^{u) — 1 = 
uip'^{cuu) for some c„ € (0, 1). The result follows. □ 

3. Case of no Gaussian component. In this section, we consider the case 
0"^ = and focus on the nonparametric estimation of h. For reasons that 
will appear below, we suppose that we have at our disposal a 2n-sample, 
{Zk)i<k<2n, with Zk = Z^ = LkA - L{k-i)A- We assume that A = A„ tends 
to and nA„ tends to infinity. Hence, A and Z^ depend on n. However, to 
simplify notation, we omit the dependence on n and simply write A,Zk- 
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3.1. Definition of estimators depending on a cut-off parameter. For a com- 
plex valued function / belonging to (M) , we denote its Fourier transform 
by f*{u) = J e*"^/(x)(ix. For integrable and square integrable functions /, 
/ii /2) we use the following notation: 

\fix)\^dx, {fuf2)= j fl{x)f2{x)dx 

(z denotes the conjugate of the complex number z). We have: = 
27r/(-x) and (/i,/2> = l/(27r)(/r , /|). 

By formula (1.2), under (HI), is and we have, as ci^ = [see (2.1)]; 



iA I 6 + / h{x) dx 1 = A(/>(ti) 



V'A(^i) 
Derivating again gives 

(3.1) , i/V.lMV>AM-W>kM)' 

where, for all u, limA-5.0 ^a('u) = 1- By splitting the 2n-sample into two inde- 
pendent subsamples of n observations, we introduce the following empirical 
unbiased estimators of V'A , V'a ' V'a • 

qn 

^A?.H = - E j = o,i,2,g = i,2. 

k=l+{q~l)n 



We also define, based on the full sample, the estimator of 

2n 

2n 



2n 

/n ^ — ' 



k=l 

We now build estimators of the Fourier transform h* of /i. Considering the 
expression of h* in (3.1), we replace '0 A , V' a ' V' a numerator by the 

empirical estimators built on the two independent subsamples of size n. In 
the denominator, ip'^ is simply replaced by 1. This yields 

(3.2) k{u) = 1(^« (n)V;£i W - A°^2(^))- 

Hence, using independence of the two subsamples, 

^h*{u) = ^((Vk(^))' - i^'k{u)i^A{n)) = h*{u) + h*{u){^l{u) - 1). 

Introducing a cut-off parameter m, we define an associated estimator of h 

1 

2^ 
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This means that h*^{u) = /j*(u)l[-7rm,7rm](^^)- By integration, the fohowing 
expression is available: 



f . . _ 1 \- (72 y y sin(7rm(Zfc + Zj+„-a;)) 

We also define another estimator of h* of h by setting 

1 



(3.3) -h*iu) = -U^l\u) 



Here, using (3.1), we get 

(3.4) Eh*{u) = -^i^'Hu) = h*iu) + h*{u){'iPA{u) - 1) - AijA{u)(l)\u). 
Thus, h* is simpler but has an additional bias term. We set 

2n. 



(3.5) h^[x)-^^J_^j ^ ^^i^^-anA^^'^ vr(Zfc-x) 



3.2. Risk for a fixed cut-off parameter. Next, let us define 
hm{x) = — e-'^''h*{u)du. 

J—Tvm 

Then we can prove the following result. 

Proposition 3.1. Assume that (Hl)-(H2)C^j and (H3) hold. Then 
Edlft™ - hf) < \\h„ -hf + 72E{Zf /A)- 



(3.6) 

+ / u^c\u)\h*{u)\'^du 

^ J —irm 

m'hn. - hf) < \\hm - hf + E{Zf/A)^ 

(3.7) 

2A2 z"'^™ 



)A2 rnm 

- — / u^c^{u)\h*{u)\'^du + CA'^B„ 



with C a constant, c{u) is defined in Lemma 2.2, = (2/7r) l'^(''^)l^ du 
[see (2.1)] satisfies Bm = 0{m) €Li(R) andBm = 0{m^) otherwise. 

Remark 3.1. We stress that the estimator km is more complicated to 
study, but hm. has an additional bias term. 
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3.3. Rates of convergence in Soholev classes. The following result con- 
cerns classes of functions h belonging to 

(3.8) C(a,L) = |/G (LinL2)(M),y" {I + u^Y\f* {u)\'^ du < L^. 

Proposition 3.2. Assume that {Rl)-{Yi2)(4) and (H3) hold and that 
h belongs to C{a,L) with a > 1/2. Consider the asymptotic setting where 
n — )■ +00, A — )• 0, nA — )■ +00 and assume that m < nA. If nA^ < 1, then, 
for the choice m = 0((nA)^/(^""'"^^), we have 

E{\\h^ - hf) < 0((nA)-2'^/(2'^+i)). 

If a>l, the condition nA^ < 1 can be replaced by uA"^ < 1. The same result 
holds for hm- 

Remark 3.2. We can also discuss the case where a E (0,1/2]. If a < 
1/2, I |/i*(f )| = 0(|ti|^/^~"). Hence, the last term in (3.6) is of order 
A^m^ which is less than m~^" if A^m^~^" < 1 and thus A^m^ < 1. This 
requires nA^^^ < 1. The same holds for hm- 

Note that no lower bound result is available for this problem. A bench- 
mark for comparison could be the problem of density estimation for i.i.d. 
observations without noise: if the density / belongs to C{a,L), the optimal 
minimax rate is of order 0(n~^"/(^'^"'~^)) [see Ibragimov and Khas'minskij 
(1980)]. 

3.4. Model selection. The estimators hm,hm are deconvolution estima- 
tors that can also be described as minimum contrast estimators and pro- 
jection estimators. For details, the reader is referred to Comte and Genon- 
Catalot (2009, 2010b). For m > 0, let 

S^ = {f £ L^(M),support(/*) C [-7rm,7rm]}. 

The space Sm is generated by an orthonormal basis, the sinus cardinal basis, 
defined by 

^m,j{x) = Vm^imx - j), j G Z, ip{x) = {ip{0) = 1). 

This is due to the fact that (p'^j{u) = (e*"-'/'"/-v/m)l[-7rm,7rm](^^))i £ ^- For 
a function / gL2(M), f^{x) = (27r)-^ e-*"^/*(ii) dn is the orthogonal 
projection of / on Sm- Introducing, for a function t G Sm, 

7n{t) = \\tf--{h*,t*) = \\tf-2{hrr,,t), 

vr 
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we get 



hm = arg mill 7„(t), 



and ^n{hm) = — ||/^^m|P- We have 

hm = 'Y^ am,j^m,j with am,j = ^ / ^* {'^)'Pm,j {-u) du 

jeZ ^ J -inn 

and ||/im|P = 1/(271") |/i*(n)p (in. The coefficients Omj of the series as 

well as 1 1 /imp can be explicitly computed by integration. In the same way, we 
set 



and obtain 



rn{t) = \\tf--{h*,t*) = \\tf-2{h,n,t), 



hra = arg min r„(t). 

t^Sm 



Analogously, hm has a series expansion on the sinus cardinal basis with ex- 
plicit coefficients and ||/im|P has a closed- form formula. We give the explicit 
expression of ||/im|P which is less cumbersome than 

(3-9) 11^-11' = 4;|k2 ^ Zlzf^{m{Zi,-Zi)). 

l<k,l<2n 

Now, we need to select the best m as possible, in a set A^„ = {m E N, 1 < 
m < nA} = {1, . . . , m„}. For the estimators hm, we propose to take 

(3.10) m = arg min (— + pen(m)) 

m&Mn 

with 

m 



pen(m) = k- 



n 



A2 



k=l / ^ k=n+l ^ k=l / 



The intuition for this choice is the following. The expression of pen(m) 
is an estimator of the variance term of the risk bound (3.6) as close as 
possible of the variance [see (8.2)]. The term — ||/im.|P is an estimator of 
— l|/im|P = \\h — /imp — l|/iP) which is up to a constant, the bias term of the 
bound (3.6). This is why m mimics the optimal bias- variance compromise. 
For the estimators hm, we define 

/ / 1 2n 

(3.11) m = arg min 

meMn 

The following result shows that the above data-driven choices of the cut-off 
parameter lead to an automatic optimization of the risk. 




ESTIMATION FOR LEVY PROCESSES 



9 



Theorem 3.1. Assume (H1)-(H2)(16)-(H3)-(H4). //, moreover, h* G 
L"'^(M) and ?iA^ < 1, there exist numerical constants k,k' such that 

Ed,;.. - .IP) < C__ W ^(||. - VIP + »(AE^(f ) +e(|)) ^) 



/\2 rnmn 

+ — u^\h*{u)\'^du + C 

H\\hin-hf)<C inf (\\h-hmf + K'¥.(^\ 



™" - - ln2(nA) 

7rm„ 



A j-Km-n 

+ — U^\h*{u)\^du + ^'^Brar,+C 

where -Bm„ =0{mn) (Bm^ is defined in Proposition 3.1). 

The numerical constants k, k' have to be cahbrated via simulations [see 
discussion in Comte and Genon-Catalot (2009)]. 

By computations analogous to those in the proof of Proposition 3.2, we 
obtain the following corollary. 

Corollary 3.1. Assume that the assumptions of Theorem 3.1 are ful- 
filled. If, for some positive L, h^C{a.,L) with a> 1/2, i/ien E(||/i^ — Zip) = 
C>((nA)-2'^/(2a+i)) provided thatnA"^ < 1. The same holds for K{\\hfh - hf) . 
If CL>1, the constraint nA^ <1 is enough. 

4. Study of the general case {a"^ 7^ 0). In this section, we assume (Hl)- 
(H2)(3) and study the estimation of the function 

p{x) = x^n(x). 

We suppose that we have a sample of size n, {Zk)i<k<n, ^fc = -^fcA — -^^(fc-i)A- 

4.1. Definition of the estimators. We compute the three first derivatives 
of [see (2.1)]: 



Derivating again gives 



ib-ua'^ + i / h{x) dx j = A{<f>{u) - ua^). 
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and last 



Let 



A V ^/\{u) i}j^{u) ^%{u) 



n 

P*{y) = with (n) = - ^{iZkfe'^^K 

k=l 



Then 



2vr riA ^ 7r(Zfc - x) 

Let us set 

(4.2) (^(-u) = - na^ = i6 - / /i* (^;) dv - ua^. 

Jo 

Using V'a(^) ~ A7pj\{u)(j){u) and some computations, we get 

Ep*(n) = {iPa{u) - l)p*{u) - 3iAiPA{u)4>{u){a^ + h*{u)) 

(4.3) 

+ iAVA(n)(<^(n))^ 

Remark 4.1. By a method analogous to the one used for h, considering 
a sample of size 3n, we can build another estimator of p* which is less biased 
but more complicated to study. 

4.2. Risk of the estimators. The risk of the estimator with fixed cut-off 
parameter is bounded as follows. 



Proposition 4.1. Under (H1)-(H2)(6) and (H5), 
E{\\p^-pf)<\\p-p^f + E{Zf/A)- 



(4.4) 



+ c 



(/■■Km \ 
A2 / n2(l + n2)|p*(n)|2dn + AW + AVM 
J —-Km I 



where Pm{x) = (27r) ^/T^c '^'^^p* {u) du denotes the orthogonal projection 
ofp on Sm- 



We can state the result analogous to the one of Proposition 3.2. 
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Proposition 4.2. Assume that (HI), (H2)(6), (H5) hold and that p 
belongs to C{a,L). Consider the asymptotic setting where n — )• +00, A — )• 
and nA — )• +00. If nA^-*^/^ < 1, then 

E(||p^-pf)<0((nA)-2a/(2a+l)^_ 

If a> 1/2, the condition uAJ/^ < 1 can he replaced by nA? < 1. 



4.3. Model selection strategy. The data driven selection of the best pos- 
sible m imposes here a restricted collection of models. We choose M„ = {m G 
N/{0},m<\/^ :=//„}. 

We can consider the estimator pfn where 

?fi = arg min ( — ||j5m||^ + pen(m)) 

(4-5) 



with pen(m) = ^'-^ ( ^ ^ Z| 



nA2 

We can prove the following result. 



Theorem 4.1. Under assumptions (HI), (H2)(24), (H5), (H6) and with 
nA^ < 1, there exists a numerical constant k such that (with fin = VnA ) 



n\\pin-p\?) 

<C inf Up-p^f + n'^(^\ 



nA 

+ C f ^ T" n2(l + u^W(u)\^du + AV^ + AV.I + '^^] . 

The consequence of Theorem 4.1 is that the adaptive estimators reach 
automatically the expected rate of convergence when p belongs to a Sobolev 
class. This can be seen by computations analogous to those of Proposi- 
tion 4.2. 



5. Parameter estimation. Under (HI), the observed process may be writ- 
ten as Lt = bt + aWt + Xf where {Wt) is a standard Brownian motion, (Xt) 
is a Levy process, independent of {Wt), of the form 

Xt= / x{p{ds,dx) — dsn{x) dx), 
J]o,t] Jr/{o} 

where p{ds,dx) is the random jump measure of (Lt) [and (^t)]. 
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If moreover j \ x\n{x) dx < oo, then Lt = bot + aWt + Tt where bo = b — 
f xn[x) dx and 

Tt= / xp{ds,dx) = Xt + t / xn{x) dx = y^rs — Ts_ 
J]o,t] Jr J 

is of bounded variation on compact sets. We consider here a sample of size n. 
By using empirical means of the data Z^, it is possible to obtain consistent 
and asymptotically Gaussian estimators of b {i = 1) and, under suitable 
integrability assumptions on the Levy density, of f x^n{x) dx for i>3. But 
this method fails to estimate a for i = 2 (see below). For this, one has to 
use another approach based on power variations. 

5.1. Some small time properties. To study estimators of b and a, small 
time properties of moments of La are needed. For simple moments, the 
result is stated in Lemma 2.1. For absolute moments, we refer, for ex- 
ample, to Figueroa-Lopez (2008): if J^^^^^-^^y\x\'^n{x) dx < +00, and r > 2, 

A~^E(|La|'') — > / |a::|''n(x) as A — t- 0. For the case of \x\^ with r < 2, we 
state the following proposition. 

Proposition 5.1. (i) Let (Ft) be a Levy process with no continuous 
component and Levy measure 77.(7)^7. // / |7|n(7) (i7 < 00, b = J 771(7)^7 
and for r <1, J |7|''n(7) d'y < 00. There exists a constant C such that, for all 
A, E|Fa|'' < CA. [Under the assumption, (Ft) has finite mean and bounded 
variation on compact sets.] 

(ii) Let Xt = B^t where (Tt) is a subordinator with Levy density nr sat- 
isfying b = jf^°° 7nr(7) ^7 < 00 and {Bt) is a Brownian motion independent 
of (Fj). The Levy measure of {Xt) has a density given by 

r+00 1 

(5.1) ''^^^^^/o ^"'^'^'^TS^''^^^^'^^- 

Consequently, if C = /q ' ^ Y^'^nri-i) dj<oo with r<2, then E\Xa\'' < CA. 

(iii) Let (Xt) be a Levy process with no Caussian component. Then X/^/ 
A converges to as A tends to in probability and in L'' for all r <2. 

5.2. Estimator of b. Consider a Levy process (Lt) satisfying (HI) and 
set Zf^ = Li^j\ — L(^i^_i^A as above. Let us define the empirical means 

^ n ^ n 

(5.2) b = — Y,Zk, Q = — for£>2. 

nA ^-^ uA ^-^ 

k=l k=l 

We prove now that b, ci,£>2 are consistent and asymptotically Gaussian 
estimators of the quantities b, ce,i> 2 where 
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Proposition 5.2. Assume (HI) and n tends to infinity, A tends to 0, 
nA tends to infinity. 

(i) Under (H2)(^2 + e) for some positive e, 

VnA(b — b) converges in distribution to N" (0,02). 

(ii) Under {}i2) (2{£ + e) ) for some positive e, and if nA^ tends to 0, 
VnA{ci — ce) converges in distribution to J\f{0,C2e)- 

We stress that this method provides an estimator of b which is easy to 
compute and very good in practice (see Section 7), but cannot provide an 
estimator of cj^. 

5.3. Estimation of a with power variations. Estimators of a based on 
power variations of (Lt) have been proposed and mostly studied in the case 
where nA = 1 . They are studied for high frequency data within a long time 
interval in Ai't-Sahalia and Jacod (2007). In the latter paper, the context 
is more general than ours, which implies that proofs are of high complex- 
ity. For Levy processes fitting in our set of assumptions, we can derive the 
asymptotic properties of power variations estimators with a specific proof 
given in Section 8. Consider the family of estimators of a given by 

n 

(5.3) a{r) = [a^]^/^ with = J] \Z,\^, 

fc=i 

where = E|X|'' for X a standard Gaussian variable (recall that = 
LkA — -t'(fc-l)A)- 

Proposition 5.3. As n tends to infinity, A tends to and nA tends to 
infinity, z/ nA^"'' = o(l), y/n{an'^ ~ converges in distribution to aM{0, 
cr'^'' {m2r / ml - 1)) for: 

(i) (Li) a Levy process satisfying (HI) and such that j\x\n{x)dx < 00 
and J \x\'^ n{x) dx < 00 for r < 1 . 

(ii) (Lt = bt + crWt + Xt), with Xt = B^^ , where W, B, T are independent 
processes, W, B are Brownian motions, T is a subordinator with Levy mea- 
sure ur satisfying b = /o^°° 7?^r (7) < 00 and Jfl~°° Y^'^^ri'y) d'y < 00 for 
r < 1. 

Consequently, ^/n{^^{r) — a) converges in distribution to aM{0, (cr^/'r^)(m2r/ 
ml -I)). 



14 F. COMTE AND V. GENON-CATALOT 

Table 1 

Rates for different "Drift + Browman motion + Compound Poisson processes" 
fix) AfiO,l) £W /3(1,3) 

p{x) = cx^f{x) ocx^e'''' oc2;^e"^l^>o oc a;^(l - 2;)^l[o,i] (2;) 

p*{u) (x{u^ -3u)e-''^/'^ ocl/(l-m)'' 0(l/|u|^) for large |u| 

lH>nJp'(^)\'d^ 0((™)^e-(-)') 0{{nm)-') 0((™)-^) 

/|„I<.M„^'|P*WI''^^ 0(1) 

m (best choice of m) y'log(nA) - § log log(nA) /tt 0((nA)^/**) 0((nA)^''*') 

Rateoc ^^^^^ (nA)-^/« (nA)-5/« 

For other cases of Levy processes, the result depends on the rate of con- 
vergence to of E|XAr/A''/2 [ggg Proposition 5.1(iii)] and wih stih hold if 
V^E|Xa|7A''/2 tends to 0. 

Remark 5.1. It is worth noting that the rate of convergence is ^/n. 

For r = 1, the estimator is consistent but not asymptotically Gaussian 
(because of its asymptotic bias). We have implemented these estimators 
for r = 1/2, r = 1/4 (see Section 7) for processes satisfying J \x\^n{x) dx < 
+00 for all positive r. Note that we always give integrability conditions on 
M for the Levy density. This simplifies the presentation but induces some 
redundancies. One should distinguish integrability conditions near and 
near infinity to avoid them. 

6. Examples. In this section, we give examples of models fitting in our 
framework. 



Example 1. Drift + Brownian motion + Compound Poisson process. 
Let 

1) Lt = bot + aWt + Y^Yi, 



i=l 

where Nt is a Poisson process with constant intensity c and 1^ is a sequence 
of i.i.d. random variables with density /, independent of the process (Nt). 
Then, X^^j^ Yi is a compound Poisson process and (Lt) is a Levy process 
with Levy density n{x) = cf{x). Note that ELi = 6 = 60 + / xn{x) dx. For 
the estimation of p, the rates that can be obtained depend on the density / 
provided that / satisfies the assumptions of Theorem 4.1, which are essen- 
tially here moment assumptions for the r.v.'s Yi. Any order can be obtained 
as shown in Table 1 where rates are computed for / a standard Gaussian, 
an exponential with parameter 1 and a Beta distribution with parameters 
(1,3) (for p to be regular enough). 

As J |a;|^n(x) dx <oo for all r < 1 (actually, for all r < 2), estimation of a 
is possible using a{r) for any value of < r < 1 [provided that nA^~^' = o(l)]. 
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Example 2. Drift + Brownian motion + Levy-Gamma process. 

Consider Lt = h^t + aWt + Tt where (r^) is a Levy gamma process with 
parameters (/3,q), that is, is a subordinator such that, for ah t > 0, Vt has 
distribution Gamma with parameters {(it, a) and density: a^* x^^~^ e~°'^ / 
r(/3t)]lx->o- The Levy density of (Lt) is n{x) = /3x-'^e-°'''l^yo. We have 
ELi = b = bo + f xn{x) dx and p{x) = /3x^e~°'^la;>o- 

We find p*{u) = 2/3/{a - iuf, j\^^^^Jp*{u)\^du = 0{m-^) and 

XT^^ u'^Ip* {u)\'^ du = 0(1). Therefore, the rate for estimating p is 0((nA)~^/®) 
for a choice m = 0((nA)^/^). 

As for all r > 0, / x'^n{x)dx < oo, a{r) is authorized, for any value of 
< r < 1, to estimate a. 



Example 2 (Continued). Drift + Brownian motion + A specific class of 
subordinators. 

Let Lt = bot + aWt + Lj where (Fj) is a subordinator of pure jump type 
with Levy density of the form n{x) = /3x^~^/'^x~^e~°'^lx>o with 6 > —1/2 
(thus, J xn{x) dx < oo). This class of subordinators includes compound Pois- 
son processes {5 > 1/2) and Levy Gamma processes {6 = 1/2). When 6 > 
0, the function xn{x) is both integrable and square integrable. This case 
was discussed in Comte and Genon-Catalot (2009) where the estimation of 
xn{x), when 5o = 0, a = 0, is studied. Here, we consider the case —1/2 < 
6 <0 which includes the Levy Inverse Gaussian process {6 = 0). Assump- 
tions (H1)-(H6) are satisfied. The function p{x) = x^n{x) can be estimated 
in presence (or not) of additional drift and Brownian component. We can 
compute 

,*(n)=/5-^(^ + ^/^) 



{a - iu)'5+5/2 ■ 

Thus, ij„|>^^ \p*iu)\'^ du = 0(m-(2'5+4)). As 2J + 1 < 1, u'^\p*{u)\'^ is not inte- 
grable and we have J\u\<nti„ W^Ip* iu)\'^ du = A^o(^„) = o(A^/^). The best 
rate for estimating p is 0((nA)-(2<5+4)/(25+5) ^ f^^. ^ choice m = 0((nA)i/(2<5+5))_ 
Note that A3/2 < („a)-(25+4)/(25+5) for ,^^2 < 1 and -1/2 < 5 < 0. 

We have J x^'n{x) dx <oo for r > 1/2 — 5. Hence, to estimate a using o"(r), 
we must choose 1/2 — 6 < r < 1. 



Example 3. Drift -I- Brownian motion -|- Pure jump martingale. 

Consider Lt = bt + aWt + -Bpt where W,B,T are independent processes, 
W, B are standard Brownian motion, and F is a pure-jump subordinator with 
Levy density nr(7) = /37''~^/^7~^e~"'''l^>o as above (assuming 5 > —1). The 
Levy density n(-) of (Lt) [and of (Xt = BrJ] is linked with nr [see (5.1)] and 
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can be computed as the norming constant of a Generalized Inverse Gaussian 
distribution 

2/3 / \x\ \^~^ 

where K,^ is a Bessel function of third kind (MacDonald function) [see, e.g., 
Barndorff-Nielsen and Shephard (2001)]. For 5 = 1/2, is a symmetric 
bilateral Levy Gamma process [see Madan and Seneta (1990), Kiichler and 
Tappe (2008)]. For 6 = 0, By^ is a normal inverse Gaussian Levy process [see 
Barndorff-Nielsen and Shephard (2001)]. The relation (5.1) allows to check 
that the function p{x) = x'^n{x) belongs to n and satisfies (H6) for 
(5 > — 3/4. Moreover, we can obtain 

^* ^""^ " "'^ ((a + n72)V2 " ^{a + u^/2fl^ ' 

Thus, ii„|>^„|p*(n)pdn = 0{m-^) and j^^^^^^y\p* {u)\'' du = 

A20(^„) = 0(A3/2). r^YiQ best rate for estimating p is 0((nA)^2/'^) ob- 
tained for m = 0((nA)^/'')). We have A^/^ < (^A)-^/'' as n/S? < 1. As 
J 7''/^nr(7) (i7 < oo for r > 1 — 5/2, the estimation of a by d{r) requires 
1 — 5/2 < r < 1. Therefore, we must have 5 > 0. 

7. Simulations. In this section, we present numerical results for simu- 
lated Levy processes corresponding to Examples 1 and 2 (see Section 6). 
For these models, the functions g{x) = xn{x), h and p belong to n L^(M). 
Thus, we can apply the method of Comte and Genon-Catalot (2009), to 
estimate g when 6o = 0, o" = 0, and the method developed here to estimate h 
when (7 = and p when cr 7^ 0. We have implemented the estimators hm-, Pm 
defined by (3.5)-(3.11) and (4.1)-(4.5). The numerical constant k' appearing 
in the penalties has been set to 7.5 for 5, 4 for h and 3 for p; its calibra- 
tion is done by preliminary experiments. The cutoff fh is chosen among 100 
equispaced values between and 10. 

Figure 1 shows estimated curves for models with jump part coming from 
compound Poisson processes [see (6.1)] where the 1^'s are standard Gaussian, 
Exponential <f(l), and /3(3,3) rescaled on [—4,4]. The intensity c is equal 
to 0.5. 

Figure 2 shows estimated curves for jump part of Levy Gamma and bilat- 
eral Levy Gamma type. The bilateral Levy Gamma process is the difference 
Ft — of two independent Levy Gamma processes. 

On top of each graph, we give the mean value of the selected cutoff with 
its standard deviation in parentheses. This value is surprisingly small. As 
expected, the presence of a Gaussian component deteriorates the estimation, 
which remains satisfactory on the whole. 
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Estimation of g{x) = xn(x) Estimation of h(x) = x^n{x) Estimation of p{x) = x''n(,x) 

bo = 0, o = bo = 0.25, = bo = 0.25, a = 0.5 

(al) A = 0,91 (0.03) (a2) A = 1.01 (0.05) (a3) A = 0.86 (0.19) 




(bl) m = 0.88 (0.08) (b2) m = 0.62 (0.09) (b3) m = 0.45 (0.09) 




2 4 6 8 10 12 14 ' 2 4 6 8 10 12 ' -2 2 4 6 8 10 12 



(cl) m = 0.60 (0.00) (c2) m = 0.75 (0.11) (c3) m = 0,87 (0.12) 




, I , : : : : , — U,UO ' ' . ' . . ' . ' " U , O ' . ' . ' 

-6-4-2 2 4 6 -8 -6 -4 -2 2 4 6 8 -6 -4 -2 2 4 



Fig. 1. VanabiUty bands for the estimation of g,h,p for a compound Poisson process 
with Gaussian (first line), Exponential f (1) (second line) and /3(3,3) rescaled on [—4,4] 
(third line) Yi 's, with c — 0.5. True (bold black line) and 50 estimated curves (dotted red), 
A = 0.05, 71 = 5.10''. 



We estimate the product of a power of x and the Levy density whereas 
other authors estimate n(-) on a compact set separated from the origin, see 
[12], Figueroa-Lopez (2009). Therefore, our point of view coincides with the 
usual one. Moreover we have, an obvious inequahty; setting n(x) = h{x)/x'^ 
as n{x) = h{x)/x'^, we get 

E(||(n-n)lK/[_,,,]||2)<^E(||/i-/i||2). 

Analogous inequalities hold for h{x) = g{x) jx or h{x) = p{x)/x'^ . In Figure 3, 
we plot the estimator of n(-) deduced by dividing by the correct power 
of X and by excluding an interval [—a, a] around zero. To obtain correct 
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Estimation of g{x) = xn{x) Estimation of h{x) = x^n{x) Estimation of p{x) = x^n{x) 

bo_= 0, a = 6i, = 0.25, o = 6o = 0.25, a = 0.5 

(al) Yh = 3.58 (0.36) (a2) rh = 0.93 (0.09) (a3) = 0.58 (0.09) 




I 1 1 1 1 1 1 — U.UJ I ' 1 1 1 1 1 — U. I I ' 1 1 ' 1 1 ' 1 1 1 

02468 10 12 02468 10 12 -1 0123456789 



(bl) m = 3.58 (0.27) (b2) m = 0.81 (0.11) (b3) m = 0.41 (0.04) 




-10 -5 5 10 15 -10 -5 5 10 15 -10 -5 5 10 15 



Fig. 2. Variability bands for the estimation of g,h,p for jumps from a Levy-Gamma 
process with P — l,a = 1 (first line), a bilateral Levy-Gamma process with 
(/3,a) = (0.7, 1), (/3', a') = (1,1) (second line). True (bold black line) and 50 estimated 
curves (dotted red), A = 0.05, n = 5.10*. 



representations, a = 0.1 suits for g{x)/x, a = 0.5 for h{x)/x^ and a = 1 for 
p{x)/x^. The results are satisfactory and in accordance with the difficulty 
of estimating n(-) without or with Gaussian component. 

Tables 2 and 3 show the means of the estimation results for h = E(Li) = 
6o + / xn{x) dx [see (5.2)] and cr, with standard deviations in parentheses. 

The estimation of b is good in all cases, and especially when nA is large. 
The estimation of a is clearly more difficult, with noticeable differences 
according to the values of n and A. When A is not small enough, the 
estimation can be heavily biased. In accordance with the theory, when r 
is smaller, the estimator of a is slightly better (smaller bias). Table 4 shows 
the values of nA^ and nA^"'', which should be small for the performance of 
the estimator to be satisfactory. It is worth noting that a is constantly over 
estimated. 

8. Proofs. 

8.1. Proof of Proposition 3.1. First, the Parseval formula gives \\hm — 
^IP = (l/(27r))||^m and we can note that h*{u)-h*^{u) = h* {u)l\^\y^^ 
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Estimation of n{x) = g{x)/x Estimation of n(x) = h(x)/x^ Estimation of n{x) = p{x)/x^ 
60 = 0, o = bo = 0.25, = 5o = 0.25, o = 0.5 

(al) (a2) (a3) 




2 4 6 8 10 12 2 4 6 8 10 ' -2 2 4 6 8 10 12 14 



(bl) (b2) (b3) 




) ' . ■ ■ ■ ■ ' ■ . ' -U.UO ' ■ ■ ■ ■ ■ ' r, r,c ' ' ' ■ ' ^ 

-5 -4 -3 -2 -1 1 2 3 4 -6 -4 -2 2 4 6 -OO^S _4 _2 2 4 



Fig. 3. Estimation of n{-)l[_a^a]': with a — 0.1 (first column), a — 0.5 (second column), 
a=l (third column). In all cases, A = 0.5, 71 = 50,000, A = 0.05; 25 estimated curves 
(thin dotted) + the true (bold line). 



is orthogonal to /ij^ — /ij^ which has its support in [— 7rm,7rm]. Thus, 

\\hn^-hf = ^{\\h*-h*J\'+\\h*rn-Kf). 

The first term (l/(27r))||/i* — /i^lP = 11^ ~ ^m|P is a classical squared bias 
term. Next, 

h*m{u) - Kniu) = {Kn{u) - ^Knim + [nKn{u)) " Kn{u)] 

= [k^{u) -¥.{hl,{u))] + - l]h*{u)t\u\<-,m- 

Bounding the norm of \\h*^ — by twice the sum of the norms of the two 
elements of the decomposition, we get 

E(||/^m-/i„^f ) < -E / \h*{u)-m*{u)\'^du\ 



vr 



vr 



-■Km 



+ 3 / 

-7rm 



vr 



7rm 



<-\ I \ai{h*{u))du 
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Table 2 

Estimation of {b,a), bo = 1, the true value of b in parenthesis, 
a = 0.5, K = 200 replications 

Model (Tt,A) (5.10'', 0.05) (5.10*, 0.01) (5.10", 10~^) (10*, 10~^) 



Poisson b{b=l) 1.000 (0.02) 0.997 (0.04) 0.995 (0.123) 1.001 (0.280) 

Gaussian '^(1/2) 0.602 (0.03) 0.527 (0.002) 0.504(0.002) 0.504 (0.005) 

a(l/4) 0.589 (0.03) 0.521 (0.002) 0.503 (0.002) 0.503 (0.002) 

Poisson 6(&= 1.5) 1.502 (0.05) 1.502 (0.051) 1.494 (0.142) 1.461 (0.359) 

Exp(l) ^(1/2) 0.611 (0.003) 0.530 (0.003) 0.505 (0.002) 0.505 (0.005) 

a(l/4) 0.594 (0.003) 0.522 (0.003) 0.503 (0.002) 0.503 (0.005) 

Gamma b(6 = 2) 2.001 (0.02) 2.000 (0.05) 1.998 (0.177) 2.018 (0.335) 

(1, 1) -^(l/a) 0.705 (0.004) 0.562 (0.003) 0.512 (0.002) 0.513 (0.005) 

a(l/4) 0.677 (0.004) 0.548 (0.003) 0.508 (0.002) 0.508 (0.005) 

Bilateral &(&= 1.4286) 1.426 (0.035) 1.4286 (0.076) 1.4493 (0.264) 1.405 (0.619) 

Gamma '^(1/2) 0.862 (0.005) 0.628 (0.004) 0.526 (0.003) 0.526 (0.006) 

(0.7, 1), (1.1) 17(1/4) 0.798 (0.004) 0.593 (0.003) 0.516 (0.002) 0.515 (0.006) 



Table 3 

Estimation of {b,a), bo = 1, the true value of b in parenthesis, a = 1, power variation 
method for estimation of a, K = 200 replications 

Model (",A) (5.10*, 0.05) (5.10*, 0.01) (5.10*, 10"^) (10*, 10"^) 



Poisson 
Gaussian 

Poisson 
Exp(l) 

Gamma 
(1, 1) 

Bilateral 
Gamma 
(0.7, 1), (1.1) 



&(1) 
a(l/2) 
a(l/4) 

6 (1.5) 
a(l/2) 
a(l/4) 

b{2) 
a(l/2) 
a(l/4) 

6 (1.4286) 
a(l/2) 
a(l/4) 



0.999 (0.025) 
1.082 (0.005) 
1.072 (0.005) 

1.510 (0.026) 
1.096 (0.005) 
1.080 (0.005) 

2.00 (0.026) 
1.172 (0.005) 
1.152 (0.005) 

1.425 (0.04) 
1.330 (0.006) 
1.284 (0.006) 



1.005 (0.059) 
1.026 (0.004) 
1.020 (0.005) 

1.498 (0.06) 
1.030 (0.004) 
1.022 (0.004) 

1.995 (0.068) 
1.062 (0.005) 
1.050 (0.005) 

1.431 (0.10) 
1.136 (0.005) 
1.105 (0.005) 



0.998 (0.178) 
1.006 (0.004) 
1.004 (0.004) 

1.481 (0.190) 
1.006 (0.004) 
1.003 (0.004) 

1.991 (0.196) 
1.014 (0.004) 
1.010 (0.005) 

1.429 (0.28) 
1.033 (0.005) 
1.022 (0.005) 



1.025 (0.85) 

1.005 (0.009) 
1.003 (0.01) 

1.485 (0.442) 

1.006 (0.009) 
1.003 (0.010) 

2.023 (0.195) 
1.014 (0.004) 
1.010 (0.004) 

1.492 (0.63) 
1.033 (0.01) 
1.022 (0.01) 



Table 4 

Values ofn,A, uA, nA^, nA^"'' for r = 1/2 and r = 1/4 



(n. A) (5.10*,0.05)(5.10*,0.01)(5.10*,10~"'')(10*,10-^) 



nA 


2500 


500 


50 


10 


nA^ 


125 


5 


0.05 


0.01 


nA^-^l^ 


559 


50 


1.6 


0.3 


nA^-i/* 


264 


16 


0.3 


0.06 



ESTIMATION FOR LEVY PROCESSES 21 
+ / uV(n)|/i*(n)|2dn 

^ J —1X1:11 

(see Lemma 2.2 for the upper bound of — 1| and note that iV'Al'ii)! ^ 

1). Now, we use the decomposition 

^{h*{u)-¥.{h*{u))) 

(8.1) + (^i\\(n) - ^^J'^{u))^PM + {i^^l],{u) - Vk(^))V'k W 

- (^g\(n) - ^1(«))(<)2(^) - ^a(^)) 

Considering each term consecutively and exploiting the independence of the 
samples, we obtain 

va..(r („)) < « + ^ + 

(8.2) 

<36 ^ V • 
nA 

Thus, the first risk bound (3.6) is proved. Analogously, we have 



■Km 



^{\\hm-hf)<\\hrn-hf + - I \Eh* (u) - h* iu)\^ du 

^ J — 7vm 

+ - / Var(/i*(n))dn. 



TT 



For the variance of h*(u), we use: h*{u) — E/i*(n) = —A ^(^/i^^(n) — ^/;^(u)). 
Thus, 

Var(r(^.))<^E(ZiVA). 

Next, for the bias of h*{u), we use [see first (3.4) and then (2.1)] 
\Eh*{u) - h*{u)\^ < 2|/i*(n)|2||V^A(n) - if + 2A^\^^{u)\. 
Hence, there is an additional term in the risk bound equal to 



(8.3) -AM |(/.*(n)|dn = A^5^. 

^ J —TTm 

If h* is integrable, |(/)(n)| < C and Bm = 0{m). Otherwise, |(/)^(u)| < C\u\^ 
and = 0{m^). 
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8.2. Proof of Proposition 3.2. As ||/i - /imp = (l/vr) du, 

the definition of C{a,L) imphes clearly that \\h — ^ (L/27r)(7rm)^^". 
The compromise between this term and the variance term of order m/{nA) 
is standard: it leads to choose m = 0{{nAy^^'^"'^^^) and yields the order 
0((nA)-2'^/(2'^+i)). 
For a > 1/2, we have 



\h*{v)\dv 



< 



j (1 + 7;2)-" 



dv < +00. 



Therefore, h* is integrable and \ct>{u)\ <\b\ + \h*\i. 
The last term in the risk bound (3.6) is less than 



u^\h*{u)f du < LA2(7rm)2(^-'^)+ . 



If a > 1 and nA^ < 1, we have A2(7rm)2(i"«)+ = A^ < (nA)^^ 
If a e (1/2, 1), the inequality A'^m?'^^""'^ < m~'^"- is equivalent to A^m^ < 
1. As m < nA, A^m^ < 1 holds if nA^ < 1. 

For the additional bias term appearing in the risk bound of hm, we 
have Bm = 0{m). Thus, mA^ < jn"^'^ holds, for m = 0((nA)^/(2°+i)), if 



m 



l+2a 



A^ = (nA)A^ < 1 which in turn holds if nA"^ < 1. 



8.3. Proof of Theorem 3.1. We only study hm as the result for hm can 
be proved analogously (and is even simpler). 

The proof is given in two steps. We define, for some q, < < 1, 



[(l/nA)ELi^|][(l/^A)Et.+i^, 



n 



(E(Z2/A))2 

[(i/^A)ELi^fe1 



1 



<£>/2 



(E(Z4/A)) 



<^>/2 



Hn,)+mhn^-h\\Hac). 

i|p]lf2c), we refer to the analogous proof 



so that E{\\hrh - hf) = E(||/im - 
Step 1. For the study oiE{\\h^ 
given in Comte and Genon-Catalot (2009) (see Section A4 therein). Using 



that E(Z| ) < +00, we can prove E(||/i^ 



't^c) < C/{nA). For this, we 



make use of the Rosenthal inequality [see Hall and Heyde (1980)]. 



Step 2. Study of E(||/i^ 



The proof relies on the following decomposition of 7^: 



ln{t)-ln{s) 



\t-h\\ 



hf + 2{t- s,h) --lji\t 



\t-h\\ 



2Vn{t-s)-2Rn{t-s) 
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where 

Mt) = l^{h* - nh*),t*), Rait) = - h\t*). 

As -fn{hm) = -\\h m\\ 5 we deduce from (3.10) that, for all in G 
ln{hrh) + pen(m) < -fn{hm) + pen(m). 

This yields 

\\hrh-h\\^ < llh-hrnll"^ +pen{m)-pen{m) + 2un{hrh-hrn)+2Rn{hrh-hrn). 
Then, for cpn = i^n,Rn, we use the inequality 

'^(pnihrh- hrn) <2\\hrh- hrn\\ SUp \(pnit)\ 

te5„+5A,||t||=i 

< -^ll^m - ^mll^ +8 sup |(/'n(t)p. 

» t65„+5A, 11*11=1 

Using that \\h^ — /im|P < 2||/i^ — Zip + 2\\hm — and some algebra, we 
find 

1 - 7 

T||/irn-^|P < -||/i-/im||^ + pen(m) -pen(m) 

(8.4) 

+ 8 sup |i2„(t)p + 8 sup Wn{t)\'^- 

ie5,„+5A,||i||=l teS„+5A, 11*11=1 

We have to study the terms containing a supremum, which are of different 
nature. First, for Rn{t), we have the following. 

Lemma 8.1. We have: sup,^s,r.+s^M=i ^ ^^'/-T™„ u^\h*{ur du. 

Proof. We have Rn{t) = ^{t*,{l - ■ijj\)h*). By using Lemma 2.2, we 
find 

sup \{t*,[l-i^l)h*)\^< sup |r,(l-^i)r)|2 

*eS„+SA, 11*11=1 tGS„^,||t||=i 

<27r||(l-V'i)/i*l[-.„„,™„]f 



/7rm„ 
'u2|/l*('u)|2(i'U. 
-■wrrin 



(8.5) 



On the other hand, Vn is decomposed: fn(i) = '}Zj=i ^n,j{t) + ''n(i) with 
1 

27rA^ 



^ (f , (v;S(n) - i^'L{umt\{u) - («))), 



27rA 
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and 



Lemma 8.2. We have: E{snpt^s,^^SrHM=i K{t)\^) < f • 
Proof. Using the independence of the subsamples, we can write 



e( sup |r„(t)|2 

<e( sup |r„(t)p 



^rnn ill 



= 1 



< 2^2A2 ^[II(^a!i - '^a)(V'a,2 - V'A)l[-^m„,7rm„]"^ 



^'\-V'A)(<^2-V'A)l[-.™„,.^„]f] 
/,(!) „;/ r„,M2iwriJ,(l) „// r„,M2i 



(8.6) + IK^S - V'a)(C2 - V'a)1[-.,„„,™„ 



-rn„ _ ^ ^^^^ 



~ ttA^ y rfi J ~ n 

because m„ < nA and IE(.Z^i) and E{Zf) have order A. □ 

Now, the study of the J^nj 's rehes on Lemma A.l. Let us first study the 
process We must spht Z| = Zll^i^^^^^ + Z^l^i^f^^^ with K to be 

defined later. This imphes that i^n,i{'t) = ^ni{^) + ^nii^) {P for Principal, R 
for residual) with 

n 

(8.7) ^ 

with /i(z) = ^zX,^^^^{t*,e''-ijA), 

and I'nliit) = fn,i(i) — '^?^i(*). We prove the following results for and 

t'n,2. 
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Proposition 8.1. Under the assumptions of Theorem 3.1, choose kn 
p(m,m')=4E(ZiVA)"'^"'' 



C , and 

In(nA) 



A ' 

then 

e( sup +e[ sup \v[''}{t)\' 

H<^Sm+SA,\\t\\=^ + '-tes„,„, 11*11=1 

~ nA ' 
where C is a constant. 

Proposition 8.2. Under the assumptions of Theorem 3.1, 

El sup K,2(i)]^ < — r, 

where C is a constant. 

For both 1^^1,3 and fn^, which are similar, we have to spht again = 
^k'^\Zf.\<k„\/A^ ^f''^\Zk\>kn\/A with the same kn as above. We define i^nj{t) = 
^nji^) + ^Sji^) ^s previously, for j = 3,4. 

Proposition 8.3. Under the assumptions of Theorem 3.1, define for 
j = 3,4 

(8.9) Q(m,m') = 4E2(ZiVA)!^, 

then 

e( sup [<,(t)]'-(?(m,m)) +e[ sup li^i^J.^Wp 

^^In^A)^ 
~ nA ' 

where C is a constant. 

Now, on rjg, the following inequality holds (by bounding the indicator 
by 1), for any choice of k: 

(1 - £i)peni/,(?Ti) < pen(m) < (1 + Q)ven^f^{m), 
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where penj;j(m) =E(pen(m)). It follows from (8.4) that 
1 - 7 

-E{\\h^ - hfln^) <^\\h- + peni/,(m) - E(pen(m)ln, 

u^\h*{u)\'^du 

■TVmn 



+ 8E( sup knWpln, 

^tGS™+5A,||t||=l 

Recalling that 



we have 



Ef sup Wnit)fln 



(8.11) 



je{i,3,4} 

+ Ie( sup |i^„,2(t)|^ln 



t6S™+5A,||t||=l 



< 8 + 2E[(p(m, m) + q{m, m))lf7 ■ 

We note that p{m,m') + q{m,m') = j;:{pen^f^{m) +penff^{m')). Thus, 

pen4;j(m) - E(pen(m)ln J + 128E[(p(?n, m) + q{m, m))lnj 

32 

< pen^ftXm) - (1 - g)E{penth{m)ln^) + — E[(pentft(m) + penj;,(m))ln, 

< (^1 + pen^Jm) + - (1 - £,)^ E[peni^(m)lf^J. 

Therefore, we choose k such that (32/k — (1 — £))) < 0, that is k > 32/(1 — g). 
This together with (8.10) and (8.11) yields 

Je(||/ia - hf In,) <l\\h- h^f + (2 - £>)pen,;,(m) 

u^\h*{u)\'^du + ^. 

-7rm„ nil 
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Proof of Proposition 8.1. Let m" = my m' , and note that Sm + 
Sm' = Sm" - We evaluate the constants M, H, v to apply Lemma A.l to f^i(t) 
[see (8.7)]: 



sup 1/4(2;) I < 



< 



j= sup 

27rv A zgM 



27r\/A 
k 



r(-u)e*"^?/;A(n) du 



\t*{u)\du< 



2ttVA 



27rm 



1/2 



\t*{u)\'^du 



{mr/'\\t\\ 



kr, V m" 



M. 



Moreover, 



e( sup _ [<i(t)]') < / ^^E(Zf)V'i(tx)dn 



~ nA ' ' 



The most delicate term is v: 



Y^T{ft{Z^)) = ^J^E(zfl^.<,^^|| I e-^ir(-x)^A(x)dx 

- 4^^(^'/ / e'^""'^^'**(-^)**(2/)V'A(x)VA(-y)dxdy 

= 4^2^2 jj ^A\^-y)t*i-^)i*iy)^^i^)'^^i-y)dxdy, 

where we recall that V'aH^) ~ E(Z^e*^^^). Making use of the basis {y^m",jd £ 
Z) of 5m", we have t = Y.jai'tj'Prn" ,j with = Y.jei'^] = 1> 

Var(/t(Zi)) < J"^ tjtk / / ilj^^\x-y)Lp*^„j{-x)ip*^„^f,{y) 



< 



2 A2 ( X] 



47r2A2 



X 'i/'A ix)tpA{—y) dx dy 

i'A^^ - y)'p*m",ji-x)^*m",kiy^ 

2\ 1/2 

X tpAix)ipA{-y)dxdy 
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47r2A2 



\i^l>{x-y)\'\^^{x)\' 

[— 7rm",7rm"]2 

X \'4^ii{-y)\ dxdy 



< 



1/2 

[—irm" ,-Km"Y 
\J[-27rm",27rm"] / 



Var(/i(Zi)) < ^4x2 f / / \i^^l\x-y)\''dxdy 

(8.12) 



47r2A2 ^„^__ 

Therefore, we need to study /[_2^m" 27rm"] I^A^ Recall that 0(m) 

z6 — /q" /i* (u) dv. We have 

where 



satisfy: J {u)\'^ du = |</''(n)| < and thanks to (H4), the Parseval 

equality yields 

\^^^\u)\'^du= [x^n^{x)dx= [x'^h^{x)dx. 



By assumption, h* is in Li(M), thus, \(p{u)\ < \b\ + \h*\i := M^. Therefore, 

IV'i') |2 < CA2(|0(3)|2 + A^icPy + + A\^'f + A6), 

where C is a constant depending on and \h\i. Therefore, 



27rm" 
-27rm" 



J x^h^{x)dx + A^(^j x^h^{x)dx + ATTm"\h\\ 



+ A4||/i||2+4WA'^ 



<C7iA2 



4|| i,l|2 



+ C2?n"A^ 
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Thus, using Assumptions (HI), (H3), (H4), 

[ \iP^l\u)fdu<K{A'^ + m"A^). 

J l-2nm" ,2nm"] 

As m"A^ < nA^ and nA^ < 1 we get /[_2W,2™"] I^aV)^^^^ < '^KA\ 
This together with (8.12) yields v = cVm" / A where c is a constant. 

Applying Lemma A.l yields, for = 1/2 and p{m,m') given by (8.8) 
yields 



E 



sup [z^^i(t)]2 -p(m,m'; 
teSm+s„,M\=i 



<Ci 



nA n^A 



as p(m, m') = 4i?^. We choose 



and as m < nA, we get 



E( sup 

Hes™+5„,,llt|l=i 



4 In(nA)' 



1 



nA 



(An)4ln2(nA), 

As C2xe~'-"^^ is decreasing for x > I/C2, and its maximum is l/(eC2), we 
get 



m'=l 



vV7> 1/(72 
-C2V^ 



m'<l/C2 



< 



^ 00 



< +00. 



m'=l 



It follows that 



V e( sup 



C_ 
nA' 



.',l|t||=i ^ + 

Let us now study the second term v^ -it) in the decomposition of f„j(t). 



The cases j = 3,4 being similar, we consider only v]^ ■ (t) for j = 1: 



E 



sup _ 1 1^15(^)1^ 

t^Srrin ' II ^11 
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- 47r2A2^ 



1 / />7rm„ 1 



X \ijj'^{u)\'^ du 



27r(nA)P/2 ' 



using m„ < nA and recalling that /c„ = (C3/4)(-^/n/ln(?iA)). Taking p = 2, 
which is possible because E(Zf ) < +oo, gives a bound of order ln^(nA)/(nA). 
Proposition 8.1 is proved. □ 

Proof of Proposition 8.2. For i'n^2, the variables are bounded with- 
out splitting, and the function ft is replaced by ft{z) = (27rA)~^(t*,e*^'V'A)- 
We just check the orders of M, and v for the application of Lemma A.l. 
For t G Sm" = Sm + Sm! and ||t|| < 1, we have 



sup|/i(z)|<^^/ / \t*{-u)\^du / \r^{u)\'^du 



A 

Next, 



< V^—-^ < CVmr := M. 



e( sup K2(t)]') < :^-r2 / IV'AWl'rf^ 

^ m"E2(Z2/A) _ 
~ nA ■ ■ 
Following the same line as previously for w, we get 

Var(/j(Zi)) 



AtT^A^ V././[-™",7rm."]2 



1/2 



As Va = ^(<?^' + ^'/'^IV'A, we get (recall that = \b\ + \h*\i is the upper 
bound of 10(^)1) 

1 z-™" , 2A2(||/i*||2 + 2WA2Af2) 

Var(/,(Zi)) < / |^:^(x)pdx< 



47r2A2./_w' 47r2A2 
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1 9 o 9 + 
< -{\\hf + MV„A2) < ^ 1 := y 



TT ^ TT 

.2 ^ ^ A3 



as m„ 

Proof of Proposition 8.3. Here, ft is replaced by ft{z) 



z 



l|.|<fc;VA(**'e"'^k)- Using now that \i;'^{u)\ <K{\Zi\) < ^/E{Zf), we ob- 
tain here that M = k'^yj m" \J^(Z\I On the other hand, we find iP' = 
m"E2(Zf)/(nA2). Last, we find 



Var(/t(Zi)) 
47r"2A2 

p27rm" 



< 



\tl;^l\u-vf\^M\^\^'^{-v)\'dudv\ 

[— 7rm.",7rm"]2 / 



With the bounds for jV'^l and JJ^l^^// lip'^iz)]"^ dz, we obtain u = cE(Z^/ 

8.5. Proo/ o/ Proposition 4-2. Let us take m = 0((nA)^/(^'^+^)). When 
p£C{a,L), the first two terms of (4.4) are of order 0((nA)-2"/(2a+i)^^ rj,j^g 
third term is 0(A^m^(^~")+). If a > 2, its order is A^ and is less than l/(nA) 
if nA^ < 1. 

If a E (0,2), A2?n2(2-«) = 0(A2(nA)2(2-«)/{i+2a)) ^^ich has lower rate 
than 0((nA)-2'^/(2'^+i)) if A2(nA)4/(i+2«) < 0(1), that is nA^+'^^+^'^y^ = 
72/\3/2+a < 0(1). We must consider in addition the terms A^m^ and A^m^. 
As previously, A^m^ < (nA)-2"/(2'^+i) if nA^6a+5)/(2a+3) < q^;^) ^j^^^^ 
nA5/3 < 1 if a > and JiA^ if a > 1/2. Moreover, A^m^ < (nA)-2«/(2«+i) jf 
„^(i0a+ii)/(2a+7) < ^^^^ nA"/"^ < 1 if a > and nA^ < 1 if a > 1/2. 

8.6. Proof of Proposition 4-1 ■ As previously, ||pm— p|P = 27(ll?'*~?'mlP + 
WPm ~Prn\\'^)- The Variance of satisfies 

E{\\pm-Pmf) = ^mP*m-P*mf) 
1 

~ 2^ 

where 

E{Zf) _ E(Zf/A) 
nA2 nA 

We have \h*{u)\ < \h\i. By Lemma 2.2, \^{u)\ < |6| + |u|(|/i|i +^2) < 0(1 + 
|n|). Inserting these bounds in (4.3) implies 

\Eip*{u))-p*{u)\<CA\p*{u)\\u\{l + \u\) 

(8.13) 

+ 0'A(l + |n|) + 0"A2(l + |n|)3. 



(Var(p*(u)) + \E{p*{u))-p*{u)\'^)du, 

m 

\&t{p*{u)) < 
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Gathering the terms gives the announced bound for the risk of pm- This 
ends the proof of Proposition 4.1. 

8.7. Proof of Theorem 4-i- The proof follows the same lines as for the 
adaptive estimator of h. We introduce, for < g < 1, 

Provided that K{Zf^) < oo, we can make use of the Rosenthal inequality to 
obtain: 

mPrn-pflni)<C/nA. 

For the study of E(||p^ the decomposition is similar to the pre- 

vious case [see (8.4)] where hm,h are now replaced by Pm,P- The processes 
Rn(t) and i^n(t) are given by 

Mt) = ^{p*-E{p*),t*), R^{t) = ^mi -p*,t*). 

The term Rn{t) is dealt using (8.13). For the term containing fn(t), we need 
apply Lemma A.l. So, Vn is split into the sum of a principal and a residual 
term, respectively denoted by and with 

1 " 

(8.14) ^ 

with ft{z) = ^z\^,^,^^{t*,en, 

and f^(t) = I'nit) — J-'nit)- Everything is analogous. The difference is that, 
for applying Lemma A.l, we have to bound /^27rm" IV'a^(^)P^^^ (instead of 
/-27rm" I^A^(^)P previously). Using tp'^ = A4)Tp/\ [see (2.1)-(4.2)], we find 

+ A^'iljA[15(t)^^^'^ + 60(1)" {^'(u) - a^)4> + 15((/)'(n) - a^f] 
+ A [17(1)" + 360(2) (^/(^) _ ^2)2j 

+ 12A^^a4>H^'{u) - a^) + AVa<^^. 

Now, (j){u) < C(l + |n|) and all the derivatives of (f>,(f> are bounded. More- 
over, under (H6), J \(l)^^\u)\'^ du = J x^\p{x)\'^ dx < +oo. Thus, we find the 
following bound: 

/2nm" 
IV'J^ 1^ < CA'^il + A'^m^ + A^m^ + A^m'^ + A^m^ + A^^m^^) = 0{A^), 
-2nm" 

as m < \/nA. The proof may then be completed as for /i^. 



[{l/inA)mUZ^] 
(E(Z6/A)) 



< 
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8.8. Proof of Proposition 5.1. Proof of (i). The assumptions and the fact 
that r < 1 imply 



r 

|rAr= " 



Taking expectations yields EIFa^ ^ ^/ ItT^It) ^^7- 

Proof of (ii). Consider / a nonnegative function such that /(O) = 0. We 
have 



E /(X, - ) = E ^ /(Sr. - Br, 



s<t s<t 



Then, E.<i E/(Sr. - 5r._ ) = Es<t /m /(^) (Ee^^^/^^^-^--)) ^^^^^;_^^_^ ) dx. 
Since, for all x, 

we get the formula for nx- Setting rua =E|X|", for X a standard Gaussian 
variable, yields 

r+oo 

\x\°'nx{x)dx = ma I j^^'^nrij) d'y. 
Jo 

Thus, E|XAr = m,E(r^/'). As r/2 < 1, P^/ = {Zs<A^s - T.J''/^ < 
Ss<a(-'^« — Ts_y'^'^. Taking expectation gives the result. 

Proof of (iii). The result is proved, for example, in Barndorff-Nielsen, 
Shephard and Winkel [(2006), Theorem 1, page 804] [see also Ait-Sahalia 
and Jacod (2007)]. 

8.9. Proof of Proposition 5.2. We have E(Zfc) = Ab and, for £ > 2, E(Zf ) = 
Ace + o(A). Therefore, b is an unbiased estimator of b and, for I > 2, 
^/nA\Kc£ — q| = \/nAO(A). Hence, the additional condition nA'^ = o(l) 
to erase the bias. 

Setting ci = 6, ci = 6, as Var = Ac2^ + o( A) for £> 1 , we have n A Var q = 
C2e + 0{A). Writing V^(c,-Ec,) = (nA)-V2 ELi(^f-E^D = ELiXfc.n, 
it is now enough to prove that ^^^=1 E|xfc,n|^~'"^ tends to 0. Under the as- 
sumption, we have 

VE|yfc„|2+- < _-^-_(E|Zfc|^(2+.) |E(4)|2+^) < — 
which gives the result. 
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8.10. Proof of Proposition 5.3. The study of (5.3) relies on the following 
result which is standard for r = 2. 

(r) 

Lemma 8.3. Let Yt = 9t + aWt for a constant and consider dn = 



Then, for all r, \/n{an^ — a"^) converges in distribution to a centered 
Gaussian distribution with variance a'^'^ {m2r / ml — 1) as n tends to infinity, 
A tends to 0, nA tends to infinity, and nA^ tends to 0. 

Proof. We have E^n ^ = ^E|6'\/A + cjX|'', for X a standard Gaussian 
variable. Thus, 



rur J a\j2'K 

Noting that e^"^/'^' - 1 = Ouy/K/a'^ + A Y.n>2 A"/2-i and that 

/ \uY ue~'^'^ / ^'^'^^^ (in/((T\/27r) = 0, we easily obtain 

lEai''^ - an < cA. 

Thus, ^/^\mP-a''\ = o(l) if V^A = (nA2)i/2 = ^(i). Noting that E|6'\/A + 

aX\^ converges to a^ruk as A tends to 0, we get nYaidn^ — > cr'^'''{m2r/ 
ml - I). 

Finally, we look at Xfe,n = ^"ni^^/A + (j(WfcA-VF(fc„i)A)/VA|'--E|0VA + 

aXY), which satisfies nEx^ „ < c/n^ . Hence, \/n{an^ — 'Kdn^) converges in 
distribution to the centered Gaussian with the announced variance which 
completes the proof. □ 

Proof of (i). As noted above, Lt = hot + aWt + Tt with 6o = ^ - 
/ xn{x) dx. Using that, for r < 1, 1 1 X] + hiY — \ Yli'^iY\ — Yl l^iT) S^t 
|o-n - \ < Efc=i l^fcA - r(fc_i)Ar', where aij^^ is built with Yt = 

bot + aWt as in the previous lemma. Thus, applying Proposition 5.1(i), 



rUr J 



x) dx. 



Since r < 1, the constraint nA^"'' = o(l) can be fulfilled and implies nA^ = 
o(l). Hence, the result follows from the previous proposition. 

Proof of (ii). The proof is analogous to the previous one [using Propo- 
sition 5.1(ii)] and is omitted. As cr(r) = [o"!'"^]"^/^, we conclude for cj(r) by 
using the delta-method. □ 



ESTIMATION FOR LEVY PROCESSES 



35 



APPENDIX: THE TALAGRAND INEQUALITY 

The following result follows from the Talagrand concentration inequality 
given in Klein and Rio (2005) and arguments in Birge and Massart (1998) 
(see the proof of their Corollary 2, page 354). 

Lemma A.l (Talagrand inequality). Let Yi,...,Yn be independent ran- 
dom variables, let i^nxif) — (1/^) Sr=i[/(^«) ~^ifO^))] '^''^d, let T be a 
countable class of uniformly bounded measurable functions. Then for > 



E 



sup|j/„,y(/)|2-2(l + 2e2)ii'2 

< J^^-Kie'^nH^/v , 98M^ -2KiC(e'^)e/{7V2)nH /M 



n 



Ki \n Kin2C2(e2) 



with C{e^) = \/l + e2- 1, Ki = l/6 and 



sup||/||oo<M, E 



sup|z.„,y(/)|l <H, sup-VVar(/(n))< 



k=l 



By standard density arguments, this result can be extended to the case 
where J-" is a unit ball of a linear normed space, after checking that / i— ?• I'nif) 
is continuous and J- contains a countable dense family. 
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